applying natural language processing techniques for effective persian- english cross-language information retrieval

نویسندگان

h. alizadeh ph.d. , regional information center for science & technology

r. fattahi ph.d. , ferdowsi university of mashhad

m. r. davarpanah ph. d. , ferdowsi university of mashhad

چکیده

much attention has recently been paid to natural language processing in information storage and retrieval. this paper describes how the application of natural language processing ( nlp ) techniques can enhance cross-language information retrieval ( clir ). using a semi-experimental technique, we took farsi queries to retrieve relevant documents in english. for translating persian queries, we used a bilingual machinereadable dictionary. nlp techniques such as tokenization, morphological analysis and part of speech tagging were used in pre-and- post translation phases. results showed that applying nlp techniques yields more effective clir performance.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Applying Natural Language Processing Techniques for Effective Persian- English Cross-Language Information Retrieval

Much attention has recently been paid to natural language processing in information storage and retrieval. This paper describes how the application of natural language processing (NLP) techniques can enhance cross-language information retrieval (CLIR). Using a semi-experimental technique, we took Farsi queries to retrieve relevant documents in English. For translating Persian queries, we used a...

متن کامل

creating appropriate corpus for information retrieval and natural language processing in persian language

persian natural language processing (nlp) researchers have many limitations to access linguistic tools which are suitable for text processing. therefore, researchin persian text processing is very limited. since dataset is an important requirement for experiments and their evaluation, we aimed to create appropriate corpora for information retrieval and natural language processing in persian. th...

متن کامل

Applying Light Natural Language Processing to Ad-Hoc Cross Language Information Retrieval

In the CLEF 2005 Ad-Hoc Track we experimented with language-specific morphosyntactic processing and light Natural Language Processing (NLP) for the retrieval of Bulgarian, French, Italian, English and Greek.

متن کامل

Arabic Natural Language Processing for Information Retrieval

Human Language Technology has played a big role in implementing Latin based information retrieval systems. Two of the most sited techniques are stemming and truncation. Numerous studies have showed that the inflectional structure of words has a big impact on the retrieval accuracy of Latin-based languages information retrieval systems (IRS). Stemming or truncation is done for two principal reas...

متن کامل

On Arabic-English Cross-Language Information Retrieval:

A Machine Translation (MT) system is an automatic process that translates from one human language to another language by using context information. We evaluate the use of an MT-based approach for query translation in an Arabic-English Cross-Language Information Retrieval (CLIR) system. We empirically evaluate the use of an MT-based approach for query translation in an Arabic-English CLIR system...

متن کامل

Applying Natural Language Processing

In this paper, we discuss the application of Natural Language Processing (NLP) techniques to improving speech prostheses for people with severe motor disabilities. Many people who are unable to speak because of physical disability utilize text-to-speech generators as prosthetic devices. However, users of speech prosthe-ses very often have more general loss of motor control and, despite aids suc...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید


عنوان ژورنال:
international journal of information science and management

جلد ۸، شماره ۲، صفحات ۸۹-۹۸

کلمات کلیدی

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023